EFFECTS OF DATA QUALITY ON ANALYSIS OF ECOLOGICAL PATTERN USING THE Kˆ (d) STATISTICAL FUNCTION

Size: px
Start display at page:

Download "EFFECTS OF DATA QUALITY ON ANALYSIS OF ECOLOGICAL PATTERN USING THE Kˆ (d) STATISTICAL FUNCTION"

Transcription

1 Ecology, 83(1), 2002, pp by the Ecological Society of America EFFECTS OF DATA QUALITY ON ANALYSIS OF ECOLOGICAL PATTERN USING THE Kˆ (d) STATISTICAL FUNCTION ELIZABETH A. FREEMAN 1 AND E. DAVID FORD 2,3 1 Interdisciplinary Graduate Program in Quantitative Ecology and Resource Management, Box , University of Washington, Seattle, Washington USA 2 College of Forest Resources, Box , and Interdisciplinary Graduate Program in Quantitative Ecology and Resource Management, University of Washington, Seattle, Washington USA Abstract. The Kˆ (d ) function is a summary statistic of all plant plant distances in a mapped area. It offers the potential for detecting both different types and scales of patterns in a single map. Two types of errors occur in maps of individual plants. Data management errors, caused by transcription errors or other mishandling, are large errors and apply to small numbers of plants. Measurement errors, caused by the mapping techniques and equipment, are small errors that apply to all plants. Simulation of known spatial patterns combined with increasing levels of both types of error showed that: (1) data management errors cause the spatial patterns identified by the statistical function Kˆ (d ) to become less significant but do not cause a shift in scale of the identified patterns; and (2) measurement errors caused the spatial patterns identified by Kˆ (d ) to become less significant and to shift to larger scales. The effects of measurement errors are inversely proportional to the scale of interaction between plants on the map. Detection of inhibition between points is more sensitive to measurement error than detection of clustering; detection of small clusters is more sensitive than detection of large clusters; and measurement error tends to cause an overestimation of clumping size. For patterns with inhibition, estimating minimum establishment distance is more sensitive to error than the maximum distance at which inhibition affects survival probability. Two examples of tree spatial distributions from the Wind River Canopy Crane Research Facility stem map data set were analyzed using the Kˆ (d ) function. Clusters of Thuja plicata were detected and were much larger than levels of mapping error identified in the data. Significant inhibition occurs between large (dbh 20 cm) trees of all species at a scale much greater than the level of mapping error. However, the minimum distance of significant inhibition (i.e., the distance within which neighbors are never found) was on the order of the mapping error. Accurate identification of inhibition may not be possible using Kˆ (d ). Key words: aggregation; clumping; data quality; error analysis; inhibition; K(d); point processes; regularity; spatial pattern; spatial statistics; stem map; Thuja plicata. INTRODUCTION Spatial statistics calculated from measured distances between individual plants are increasingly used when studying interactions between plants and when investigating the structure of ecological communities. Single valued statistics reduce the entire spatial pattern to a single number to detect clumping or spatial inhibition. These statistics are based on nearest neighbor distances (Clark and Evans 1954, Pielou 1962, Donnelly 1978) or sample point to nearest neighbor distances (Hopkins 1954, Pielou 1959). From these statistics, it was proposed that single species forest stands (Laesele 1965) and desert vegetation (Phillips and MacMahon 1981) shift over time from clumped patterns of seedlings and young plants towards regularity due to competition. However, single value statistics are limited. They Manuscript received 7 October 1999; revised 10 November 2000; accepted 14 December 2000; final version received 9 March Address correspondence to this author. edford@u.washington.edu. 35 give no information on the spatial scales of patterns, nor do they provide information such as size of clusters or the distance of strongest inhibition. Most importantly they cannot define mixed patterns (Pielou 1962; e.g., inhibition at small scales within larger scale clumping). Increasingly the Kˆ (d ) function (Ripley 1977, Diggle 1983, Haase 1995) is being used for these purposes. Kˆ (d ) is a transformation of all interplant distances that is graphed against d. Software (e.g., MathSoft 1996) is available for calculating Kˆ (d ). The basic assumptions of Kˆ (d ) analysis are that: (1) the individual plant is the elementary sampling unit and defines grain size (Legendre and Legendre 1998: ); and (2) the size of the mapped area defines the extent of the sample. Kˆ (d ) describes the distribution of intervals between the plants on the sample area and, unless otherwise mentioned, scale refers to the size of these intervals. Ecologists using Kˆ (d ), or its square root transformation, Lˆ(d ) (Besag 1977, Diggle 1983) have found more complex spatial patterns than simply clumped,

2 36 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 random, or regular, and have used these results to make inferences about pattern forming processes. For example, Sterner et al. (1986) measured clumping in juveniles of three tropical tree species. The spatial pattern of adults was significantly more uniform than could be explained by random thinning of the juvenile clumps. Sterner et al. suggested that this pattern of mortality is consistent with the Connell Janzen hypothesis for the maintenance of high species richness in tropical forests (Connell 1970, Janzen 1970). Kenkel (1988) used Lˆ(d ) analysis, along with a refined nearest neighbor analysis, to study self-thinning in pure stands of Pinus banksiana. Based on changes in spatial pattern and stand density with increasing stand age, he suggested that two distinct competitive phases occurred: an early scramble phase of two-sided competition for soil resources, and a later one-sided competition for light. In both the Sterner et al. (1986) and Kenkel (1988) analyses, the identification of regularity within the spatial pattern indicating competition was crucial to the ecological interpretations. Lˆ(d ) is also being used in ecological modeling to define the structure of the model and to estimate parameter values. See Moeur (1993, 1997) for old-growth forest, Kenkel (1993) for the clonal herb Aralia nudicaulis, and Batista and Maguire (1998) for regenerating tropical forest. A crucial feature of each model is estimation of spatial inhibition. For detailed analysis of spatial pattern, one needs to know the accuracy of Lˆ(d ). This accuracy depends upon the accuracy of the mapped data. Several mapping techniques have been reported for stands of trees. To calculate tree positions Kenkel (1988) first established a rectangular grid and then measured the distribution of individuals within each grid segment to each of its corners. Kenkel (1993) used a similar procedure for mapping individuals of the herb A. nudicaulis in 5 5 m plots. Moeur (1993) cruised through sample plots by establishing individual reference points from which tree azimuths and distances were surveyed. Research into developing field mapping techniques for individual plants has focussed on attaining a balance between the speed of a survey and the accuracy of the map obtained. Rohlf and Archie (1978) proposed a rapid technique based on progressive mapping of individuals relative to previously mapped trees, without establishing and surveying plot corners. Hall (1991) checked this procedure, using both a grid method and by simulation techniques, and concluded that propagation of errors was a serious problem and that locating errors can be nearly impossible. Boose et al. (1998) reported improvements on this method and found that errors in tree locations depended on how accurately the distances were measured between trees and on the number of trees on the plot. In this paper we investigate how the type and magnitude of errors that might be acceptable depend upon both the type and scale of the pattern of interest. We examine data management errors and measurement errors and calculate their effects on the identification of spatial patterns using Lˆ(d ) as well as their effects on detecting and defining cluster and inhibition patterns. We use both simulations and data examples from an old-growth Pseudotsuga menziesii Tsuga heterophylla forest (Franklin and Dyrness 1988). METHODS Strategy of investigation using simulation We simulated data points in order to specify both clustered and inhibition patterns. We modified individual points to simulate mapping errors. By comparing Lˆ(d ) from the modified data points to Lˆ(d ) from the original data points the effects of the simulated errors on the Lˆ(d ) transformation of Kˆ (d ) were examined. Both data management errors and measurement errors were found in the Wind River Canopy Crane Research Facility (WRCCRF) stem map data set (Freeman 1997). These were simulated, respectively, by replacement of points and by moving points specific distances according to the error. Results are given both as a summary of all the simulations, and by a single example of each of the four combinations of spatial patterns (clustered vs. inhibited) and error type (measurement error vs. data management error). The complete set of simulations of different types of pattern combined with different amounts of error is presented in Freeman (1997). Simulated spatial patterns The simulated plot was a unit square of area A, with n 100 points. A toroidal wrap-around of the square boundaries was used so that edge effects were not a concern. Poisson cluster process. In a Poisson cluster process, parent events are distributed randomly each forming a cluster center assigned a random number of n offspring. Offspring are spatially distributed around their parent with a bivariate normal distribution with standard deviation cluster. A range of cluster sizes was investigated: cluster 0.025, 0.05, 0.075, and On the m Wind River Canopy Crane Research Facility (WRCCRF) stem map this was equivalent to cluster 5 m, 10 m, 15 m, and 20 m. Note that cluster is the standard deviation of the distance between each offspring and its parent, so cluster size is 2 cluster. Markov point inhibition process. A Markov point inhibition process assumes that only its potential neighbors affect the probability that a new point can become established. There is a cutoff distance, a, where, if any potential neighbor is closer than this distance then establishment is prohibited. There is a large distance, b, defining the outer limit of a plant s inhibitory effect. In this study, the probability of establishment, given a neighbor between the distances a and b, was chosen to be a linear function of distance. The minimum establishment distance was set at a and the maxi-

3 January 2002 DATA QUALITY EFFECTS ON PATTERN ANALYSIS 37 mum distance of inhibition varied over b 0.05, 0.10, 0.15, and On the WRCCRF stem map this was equivalent to a 1 m and b 10 m, 20 m, 30 m, and 40 m. Complete spatial randomness. Complete spatial randomness (CSR) was used, as an aid in the simulation procedures, as part of the error simulation, and as a null hypothesis in Monte Carlo tests. CSR assumes the mean number of points in any region A with area A follows a Poisson distribution with mean A, and given n points in A, the points are an independent random sample from the uniform distribution on A (Diggle 1983). The two types of errors Data management errors. These include errors made in the field (e.g., measuring the wrong plant or misidentifying a plant) as well as errors in data handling (e.g., incorrectly transcribing data, where degrees azimuth may be recorded as degrees or as degrees). Data management errors can be any size, but generally affect few data points. Measurement errors. These are inherent in mapping techniques and equipment. Their magnitude depends on the equipment used. A tree map made with a hand held compass and a measuring tape is likely to have greater absolute measurement errors than one made with a tripod based electronic theodolite and laser range finder. Mapping techniques also influence the magnitude of these errors. If a grid is established first, and individuals are mapped relative to the grid points, cumulative errors are minimized. If individuals are mapped sequentially, each relative to previously mapped individuals, large errors can accumulate (Hall 1991), although Boose et al. (1998) describe a procedure and calculation method for minimizing errors. Generally, measurement errors are small when a grid is used, but they can potentially affect all data points. A note on absolute vs. relative errors Error analysis of a mapped data set reveals absolute levels of error (e.g., the number of displaced points and the limits to mapping accuracy of correctly placed points). In this study, however, we consider relative error. For data management error, relative error is the number of displaced points relative to the total number of trees. This is straightforward to calculate, as it does not depend on the underlying, and possibly unknown, spatial pattern. For measurement error, relative error is the limit of mapping accuracy of correctly placed points, relative to the scale of the underlying spatial pattern. In simulations, the scale of the pattern is known, and thus the relative error can be calculated. For actual stem maps, the underlying scale is unknown; after all, discovering this scale may be the purpose of making the map. However, the measurement error can still be calculated relative to the scale of interest. For example, when investigating inhibition processes, the interactions of interest are on a smaller scale than when examining cluster processes. Therefore, any mapping errors will be relatively larger, compared to small-scale patterns. Note that in distributions with more than one parameter, such as in Markov point processes, the error can be calculated relative to each parameter. Examples of absolute error levels would be: in the preliminary survey data management errors occurred in 11 out of 253 trees, or the measurement errors have a standard deviation of 0.43 m. Relative error levels would be: 4.3% of the points are subject to data management errors, or the standard deviation of the measurement error is 67% of the predicted inhibition distance. Simulated errors Data management errors. Data management errors were simulated by creating large errors in location in a percentage of the data points. These points were randomly selected, and the true points were replaced with randomly located points to simulate poor data management. This is equivalent to a few plants of the wrong species being included in a map, or some of the mapping data being misrecorded or mistranscribed. The percentage of replacements was set to 0, 5, 10, and 20. This range is likely to be wider than most data management errors where the objects can be clearly identified and mapped (e.g., individual trees in a single species stand), but it was used to cover situations where data management can be difficult. Primarily we used random points for replacement. There are many possible combinations of original distribution and replacement distribution, and examining all of them was prohibitive. We did, however, investigate two combinations: clustered distributions with replacement from an inhibited distribution with a and b 0.20; and inhibited distributions with replacement from a clustered distribution with cluster This is equivalent to mistakenly including a few trees of the wrong species, with a different spatial pattern, in a map. Measurement errors. Measurement errors were simulated as small errors in location applied to all data points. Each point was shifted a small distance based on a bivariate normal distribution centered on the point, with a standard deviation error 0, 0.025, 0.050, and Translating from the simulation to a square the size of the WRCCRF stem map (200 m on a side), this was equivalent to absolute errors of error 0m,5m, 10 m, and 20 m. These were several orders of magnitude larger than the measurement errors in the WRCCRF stem map. However, errors of several meters may be realistic for a tree stem map of a large area made with a tape measure and hand compass without using a fixed grid system. Also, in the investigation of grasslands, for example, absolute errors may be low due the small distances between individuals, while relative error may be high.

4 38 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 Simulations were made on the unit square with error, the error relative to this unit square. The measurement error relative to the scale of the underlying pattern is the ratio of error to the scale of the pattern. For clustered patterns, cluster size was 2 cluster. This is because spatial analysis with Kˆ (d ) is related to the diameter of the clusters, and cluster is related to the radius of the clusters. Inhibited patterns were generated with a Markov point process, with parameters a and b, and relative error was calculated for each parameter. We made 100 simulations for each combination of parameter value used in defining pattern with either replacement rate for data management errors or error for measurement error. Spatial analysis of simulated data The Kˆ (d ) statistic. Kˆ (d ), a second order parameter, considers all distances between all individual points, not just the distances between nearest neighbors. Consequently, it gives more information on the scale of the patterns than do first order tests such as G (the cumulative frequency distribution of nearest neighbor distances) or F (the cumulative frequency distribution of arbitrary points to nearest neighbors; see Diggle 1983). K(d ) is the expected number of points within a given distance of an arbitrary point. This is calculated relative to the density of points: K(d) [E(number of points within distance d of an arbitrary point)]/(the density of points) where d distance. Estimation is by ˆK(d) (area of the plot)(number of distances less than d) (total number of points) 2 L(d ) is a square root transformation of K(d ) suggested by Besag (1977): 1/2 K(d) L(d) d. L(d ) linearizes K(d ), stabilizes the variance, and under complete spatial randomness the expected value of L(d ) is approximately zero. It can be difficult to interpret K(d ) visually. Using L(d ) makes graphs easier to read. Examples for different types of pattern can be found in Diggle (1983) and practical application in Kenkel (1993) and Moeur (1993). Lˆ(d ) is calculated for a sequence of distances. As d approaches the size of A, the results of Lˆ(d ) reflect edge effects rather than the spatial patterns of the data (Haase 1995). In this study, Lˆ(d ) was calculated at one hundred distances, evenly spaced between d 0 and d 0.5. For simulated inhibited patterns, analysis was carried out on the distances between d 0 and d 0.3. The effect of inhibition on Lˆ(d ) is primarily governed by interactions between nearby plants, while the effect of clustering on Lˆ(d ) is governed by size of the clusters. Thus inhibition is shown by Lˆ(d ) at smaller scales than is clustering (Fig. 1). The results of Lˆ(d ) are then plotted against distance. Significance is usually evaluated by comparing the observed data with Monte Carlo envelopes from multiple simulations of the null distribution, usually complete spatial randomness (CSR). Each simulation generates an Lˆ(d ) function (Fig. 2a). The repeated series of simulations each generate a further Lˆ(d ) function and a probability envelope is calculated from their highest and lowest values (Fig. 2d). Typically an envelope is calculated from the Lˆ(d ) of 99 simulations and if the Lˆ(d ) of a pattern has some part outside of that envelope it is judged to be a significant departure from CSR. Two points had to be considered in using Monte Carlo methods to assess the result of our simulations of the interaction between pattern and error. First, the Monte Carlo envelope typically has irregularities in the bounds it defines (Fig. 2d), and repeat constructions produce similar but not identical envelopes; therefore, the envelope provides a guide for assessment but not a strict test. Because of this it is inappropriate to generate a separate Monte Carlo Lˆ(d ) envelope for each simulation and expect to use each one as a rigorous arbiter of significant departure from CSR or not. As a general guide for the 100-point distributions, 99 simulations gave values of Lˆ(d ) within the limit of 0.01 to 0.01 (Fig. 2d). Second, and particularly important for this work, errors may not only change the magnitude of the pattern, but also shift the scale. Assessment of error must be made through analysis of multiple simulations of the interaction between pattern and error, both of them stochastic processes. One hundred simulations were made of each combination. The mean point was calculated for these simulations at which Lˆ(d ) reached its extreme value (Mean Extreme Value or MEV). This MEV is a maximum for clustered patterns (Fig. 3a), and a minimum for inhibited patterns (Fig. 3b). Spatial analysis of forest data The WRCCRF was established in the Thornton T. Munger Research Natural Area, in southern Washington state near the Columbia River Gorge ( N, W) to give researchers access to the upper canopies of a low elevation old-growth Douglas fir Western hemlock forest with canopy trees 500 yr old. DeBell and Franklin (1987) and Franklin and DeBell (1988) provide background information on stand dynamics at this site. A 4-ha square with edges running north south and east west, and with the crane at its center point, contained 2168 trees with dbh 5 cm, including snags, divided among nine species. The most common species were Western hemlock Tsuga

5 January 2002 DATA QUALITY EFFECTS ON PATTERN ANALYSIS 39 FIG. 1. Examples of simulated point patterns on the unit square and their corresponding calculated Lˆ(d ) functions: (a, b) random, (c, d) clustered, and (e, f) inhibited. heterophylla ((Raf.) Sarg.), Pacific yew Taxus brevifolia (Nutt.), Douglas fir Pseudotsuga menziesii ((Mirb.) Franco), and Western red cedar Thuja plicata (Donn ex. D. Don). Professional surveyors surveyed a 25-m grid on the 4-ha study site. All trees with dbh 5 cm were numbered and tagged, and species and dbh were recorded. Tree locations were mapped from surrounding grid points using a Criterion 400 Survey Laser (Laser Technologies, Englewood, Colorado, USA) to measure azimuth angle, horizontal distance, and inclination. Eighty-five percent of the trees were measured from two grid points. In some cases only one measurement set was possible due to heavy underbrush or other trees. Measurements were converted to X and Y coordinates. To estimate the level of measurement errors, 37 trees located in two grid squares were each mapped from three grid points. Measuring from two grid points increased the time involved in collecting the data. It did not make a great difference to the standard deviation of the measurement errors, but it did reveal data management errors. Errors FIG. 2. Procedure for generating a Monte Carlo envelope for Lˆ(d ). (a) One simulation of complete spatial randomness (CSR) for n points. (b) A second simulation of CSR for n points added to the first simulation. (c) A third simulation of CSR for n points added. (d) Ninety-nine simulations of CSR and the resulting Monte Carlo envelope (stippled line). Typically a test is constructed by considering data as the 100th case and examining whether it exceeds the envelope at any scale.

6 40 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 FIG. 3. Calculation of Mean Extreme Value for sets of clustered or inhibited patterns generated by the same process. (a) For clustered patterns, the maximum values of each simulation (open circles) are averaged to find the mean of the maximum values (solid circle). (b) For inhibited patterns, the minimum values of each simulation (open circles) are averaged to find the mean of the minimum values (solid circle). for 98 trees (4.5%) were caught in this manner when locations that were calculated from two surveyed positions differed substantially. By assuming that measurement error followed a bivariate normal distribution, with no correlation between position of the point and size of the error, the within tree standard deviation was m when trees were mapped from at least two grid points. Points mapped from the mean of two measurements would follow a bivariate normal distribution with error m. One theory of competition states that the distribution of plants becomes more spatially even over time, since plants growing close together are subject to increased competition, and thus to increased mortality (Ford 1975, Sterner et al. 1986, Kenkel 1988, 1993, Kenkel et al. 1997). In this forest, the older, and thus larger, trees are assumed to be survivors of such competition, and thus would be more likely to show the effects of inhibition as found by Moeur (1993) in an old-growth coniferous forest. The WRCCRF has a policy of strictly limiting destructive sampling, and cores to age trees could not be taken. RESULTS The effects of errors on the mean extreme values Data management error. Generally the detection of both clustering and inhibition using Lˆ(d ) is robust to data management error. The addition of simulated data management errors, drawn from a complete spatial randomness (CSR) distribution, to clustered patterns reduced the amplitude of the maximum values of Lˆ(d ) but did not shift the maximum to a larger scale (Fig. 4). Clustering was still detected even when 20% of the points had been replaced from CSR (i.e., Lˆ(d )atthe distance of 0.10 was much greater than 0.01). Similarly, simulated data management errors added to inhibited patterns made minimum values of Lˆ(d ) less extreme, though they remained much less than 0.01, and shifted them slightly to larger scales (Fig. 5). Clustered patterns with inhibited replacement showed little difference from CSR replacement at replacement rates of up to 20%. The inhibited patterns with clustered replacement were affected more strongly than with CSR replacement, but only after replacement rate reached 20%. The Mean Extreme Value (MEV) of Lˆ(d ) was calculated for 100 simulations of each of four clustered and four inhibited patterns, each combined with four levels of simulated data management errors from a CSR distribution. The change in MEV with increasing simulated error for both clustered (Fig. 6a) and inhibited (Fig. 6b) patterns was generally small relative to that found for measurement error (Fig. 6c, d). Clustered patterns decreased in the absolute value of the MEV, but the value was always substantially 0.01, and no consistent change in the estimated scale of pattern was shown. For inhibited patterns the absolute value of the MEV increased but did not reach With inhibited patterns, the MEV did show a slight shift to larger scales, but not nearly to the extent that they shifted with measurement errors. Measurement error. The addition of measurement errors over the range of simulated patterns had substantially greater effects than data management errors. The apparent size of clusters increased (Fig. 7a, c, e, and g) and maximum values of Lˆ(d ) decreased, becoming closer to CSR, and shifting to larger scales (Fig. 7b, d, f, and h). The shift in scale was present in all four clustered patterns (Fig. 6). It was more obvious,

7 January 2002 DATA QUALITY EFFECTS ON PATTERN ANALYSIS 41 FIG. 4. Data management error applied to a clustered pattern ( cluster 0.10), illustrating how increasing data management errors cause the corresponding maximum value of Lˆ(d ) to lose amplitude, but not to move to larger distances. Solid circles maximum Lˆ(d ); panel (a) corresponds with (b), (c) with (d), and so forth. Calculations are based on the unit square. and showed up at smaller levels of error, in the distributions with smaller clusters, cluster 0.025, 0.05, as the level of error relative to the cluster sizes was larger in these distributions. The shift was most pronounced when error, the standard deviation of the measurement error, was 2 cluster, the scale of the clusters (i.e., when the error relative to the cluster size was 100%; see Fig. 6c). For addition of measurement error to an inhibited distribution the minimum values of Lˆ(d ) increased, to become closer to CSR, and shifted to larger scales (Fig. 8). Note that when looking only at the MEV, it would appear that measurement error has a greater effect when b (the maximum distance at which establishment is affected by neighbors) is large (i.e., when inhibition is occurring at greater distances). However this is partly an artifact of the summarizing process. MEV provides a useful summary but is not appropriate when the level of error completely obscures the underlying pattern (i.e., once the apparent distribution resembles CSR). When the distribution approaches CSR, MEV lies at the midpoint of the distances examined. Thus in this case (where Lˆ(d ) was calculated from d 0 0.3), MEV for distributions resembling CSR tended to be located near d 0.15 (Fig. 6d). As a result, for the smallest radius of inhibition influence tested (i.e., b 0.05), as measurement errors increased to a scale larger than b (i.e., from error 0.05 to error 0.10), MEV actually reversed its trend, and shifted slightly towards smaller scales. Thus when examining the effect of measurement errors on inhibited distributions, the most sensitive combination of error and distribution types, it was nec- FIG. 5. Data management error applied to an inhibited pattern generated as a Markov point inhibition process with model parameters a 0.005, b Data management errors cause the minimum value of Lˆ(d ) to lose amplitude, but do not move to larger distances. Solid circles minimum Lˆ(d ); panel (a) corresponds with (b), (c) with (d), and so forth. Calculations are based on the unit square.

8 42 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 FIG. 6. Effects on Mean Extreme Value of Lˆ(d ) calculated from 100 simulations of addition of (a, b) data management error or (c, d) measurement error, on (a, c) clustered and (b, d) inhibited patterns. For clustered patterns, cluster was varied: 0.05 ( ), 0.10 ( ), 0.15 ( ), or 0.20 ( ). For inhibited patterns the parameter a of the Markov point process was held constant at 0.005, while parameter b varied: 0.05 ( ), 0.10 ( ), 0.15 ( ), or 0.20 ( ). Data management error was simulated by replacing a percentage of points using complete spatial randomness (CSR): solid symbols, no change; horizontal hatching, 5% change; vertical hatching, 15% change; open symbols, 20% change. Measurement error was simulated by addition of error from a bivariate normal distribution with error to all points: solid symbols, no change; horizontal hatching, 0.025; vertical hatching, 0.05; open symbols, FIG. 7. Measurement error applied to a clustered pattern ( cluster 0.10), illustrating how increasing data measurement errors cause the corresponding maximum value of Lˆ(d ) to lose amplitude and move to larger distances. Solid circles maximum Lˆ(d ). Calculations are based on the unit square.

9 January 2002 DATA QUALITY EFFECTS ON PATTERN ANALYSIS 43 FIG. 8. Measurement error applied to an inhibited pattern generated as a Markov point inhibition process with model parameters a 0.005, b Measurement errors cause the minimum value of Lˆ(d ) to lose amplitude and move to larger distances. Solid circles minimum Lˆ(d ). Calculations are based on the unit square. essary to look at the unsummarized data, an example of which is shown in Fig. 8. When b is small, measurement errors are larger relative to the scale of the inhibition, and thus have a greater effect on Lˆ(d ). Once error (the standard deviation of the measurement error) approaches b (i.e., the measurement error relative to b approaches 100%) it is impossible to distinguish between the mapped pattern and CSR. Even at levels of error as small as 20% of b, the regular pattern resulting from an inhibition process is more sensitive to the effects of error than clustered patterns. Often Lˆ(d ) will no longer indicate inhibition at small distances, while still showing inhibition at larger scales. When looking at small distances, the relative error is larger, and thus errors tend to affect small distances first. Generally detection, and particularly determination of scale, of both clustered and inhibited processes using Lˆ(d ) is less robust to measurement error than to data management error. Inhibition patterns and small scale clustering are more sensitive than large scale clustering. Standard deviation of Mean Extreme Value (MEV) The type of simulated error had little effect on the variance of the extreme value of Lˆ(d ), or of the variance of the distance at which this extreme value occurred. The standard deviation of the MEV ranged from to for clustered distributions, and from to for inhibited distributions. These values are much smaller than the differences caused by either type of error. The standard deviation of the mean distance at which MEV occurred ranged from to for clustered distributions, and from to for inhibited distributions. These values are much smaller than the differences caused by measurement error, but on the order of the differences caused by data management error. Kˆ (d ) applied to the WRCCRF stem map Spatial analysis of all nine canopy tree species at the WRCCRF was conducted (Freeman 1997). Here we choose two examples; live Thuja plicata for investigating clustered distributions, and all live trees 20 cm dbh, for investigating inhibition patterns. Clustering is the dominant spatial pattern of individuals of single species (Freeman 1997). Clustering may be the result of a parent child process with small seed spread distances. Also some microclimate variations may favor a single species and produce clusters. We chose T. plicata for illustration of the potential effects of errors because of its reasonable sample size (n 121), and its lack of a visually obvious pattern in its tree map. Thuja plicata shows significant clustering at medium to large scales. Living trees are clustered from 10 m to 60 m, with marginal clustering at smaller scales, and random distributions at larger scales (Fig. 9a). The cluster size is considerably greater than the level of measurement error (within tree standard deviation m for trees located using one survey point, m for trees located with two). Thus measurement error has little effect on detection or estimate of cluster size even if trees were mapped from only one grid point. Live trees with dbh 20 cm show inhibition between 0.4 m and 13.2 m (Fig. 9b). The minimum intertree distance was 0.32 m. Levels of measurement error for this data have little effect on detecting the upper bound of the inhibition. However, in the simulations, detection of inhibition began to disappear with levels of relative measurement error as small as 20% and was entirely lost when the relative error was 100%. For the data of live trees of all species with dbh 20 cm inhibition occurring at distances m (1 5 times the measurement error) could be obscured by measurement

10 44 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 FIG. 9. Lˆ(d ) for subsets of the WRCCRF stem map: (a) Western red cedar, (b) all trees 20 cm dbh. The subset by species shows significant clustering at medium to large scales (10 60 m), while the subset by size shows significant inhibition at small scales ( m). error. If a single grid point had been used for mapping tree locations, inhibition at distances m would not have been detected. For this map, Kˆ (d ) can be used effectively to detect the outer range of competitive influence, b. But the minimum distance between trees with dbh 20 cm was 0.32 m. Thus the levels of error found reduced the accuracy of estimations of the minimum establishment distance. Furthermore, data of this quality would be unsuitable for detecting the onset of competition process among small trees such as clumps of young natural regeneration that grow at smaller neighbor distances. DISCUSSION Data management error. Data management errors reduced the strength of identification of patterns by the Lˆ(d ) transformation of Kˆ (d ) but this type of error did not shift the identification of the scale of the patterns, even at relatively large error levels, such as 20% replacement. Data management error was simulated with a two-stage process: first, points were removed from the pattern; second, they were replaced with random locations. Clustered distributions are characterized by large numbers of small interpoint distances. Removing points from these patterns also removed some of these small distances, causing clusters to become less dense, but the estimated size of the clusters was not affected. Random replacement added a few new large distances, but their contribution to the perceived scale of the clusters was negligible when compared to that of the original clusters. Inhibited distributions are characterized by the lack of small interpoint distances, and the removal of points has little effect on this characteristic. Random replacement added a few new small interpoint distances, and thus reduced the perceived strength of the inhibition, but again the scale did not shift. Nonrandom replacement at rates 20% showed little difference from random replacement. With replacement rates 20%, pattern identification using Kˆ (d ) became less clear. With these higher replacement rates, inhibited distributions were more affected than clustered distributions. However if actual field data has this amount of data management error it is probably time to rethink the study. We only looked at two combinations of spatial pattern and error distributions. It is possible that other combinations, such as replacing points with the same pattern but at a different scale, may have greater effects. This should be kept in mind if this type of error may be significant in a study. Measurement error. Addition of measurement er-

11 January 2002 DATA QUALITY EFFECTS ON PATTERN ANALYSIS 45 rors reduced the amplitude of Lˆ(d ) and the MEV peaks to larger scales. This effect was strongest when the scale of errors approached the scale of the underlying pattern. Because of this (1) inhibition is more sensitive than clustering, (2) small clusters are more sensitive than large clusters, and (3) in inhibition, the minimum establishment distance was more sensitive than the maximum inhibition. Both clustered and inhibited patterns showed an apparent shift to larger distances with the addition of measurement error. Inhibited patterns became indistinguishable from randomness at lower error levels than clustered patterns. In this study the inhibited patterns were defined by two parameters; a, the distance below which no nearest neighbor distances are found, and b, the maximum distance at which an inhibition effect occurs, with a smaller than b. While b was of the same scale as 2 cluster, a was much smaller. Thus error relative to a was much larger than error relative to b and 2 cluster. Also while measurement error caused peak maximum deviation of Lˆ(d ) from zero for both clustered and inhibited patterns to shift to larger scales, the causes of this shift were different for the two distributions. In clustered patterns, measurement error actually caused the perceived size of clusters to increase (Fig. 7). In inhibited patterns, however, the addition of measurement error did not increase the perceived inhibition distance. Instead, it obscured small-scale inhibition while leaving visible larger scale inhibition already present in the data (Fig. 8). To understand this effect, the characteristics of clustering and inhibition must be considered, and how these are reflected in Lˆ(d ). For clustered patterns, the shape of Lˆ(d ) is determined by the increased number of small interpoint distances relative to randomness. For example, if there are clusters 20 m in diameter, then there are an increased number of interpoint distances between 0 m and 20 m. If measurement errors are added to all the points, even measurement errors as large as error 10mor error 20 m, there are still clusters. They just look larger and less concentrated. Some of the clusters may blur into each other, but this will just form still larger clusters. When Lˆ(d ) is calculated from this data, the maximum value is less, due to the lower density of trees within the clusters, and the peak shifts to larger scales, due to the larger size of the clusters. However Lˆ(d ) still shows clustering. On the other hand, Lˆ(d ) for inhibited patterns is characterized by the absence of small interpoint distances relative to complete spatial randomness (CSR). If, for example, there is gradually increasing inhibition between 20 m and 0 m, then even a level of measurement error as small as error 2mor error 1 m will move points sufficiently to increase the number of interpoint distances markedly. Then Lˆ(d ) will not show the smallest scale inhibition. If the maximum inhibition distance is greater than the scale of measurement error, there will still be some evidence of inhibition at the larger distances; but once the level of measurement error reaches the scale of the inhibition, nothing will be left of the pattern. Wind River Canopy Crane Research Facility stem map. In the WRCCRF stem map, mapping each tree from two grid points made it possible to catch data management errors by comparing the two tree locations for discrepancies. This technique reduced data management errors by 4.5%. But both clustered and inhibited patterns are relatively insensitive to data management errors 20%, and these errors would have had little effect on the results of Lˆ(d ). However measurement errors play a bigger role. At levels of relative measurement errors on the order of those found in the WRCCRF stem map, the increase in measurement accuracy from the second grid point may be important. This is particularly so if the researcher is interested in estimating the minimum establishment distance for an inhibition process, or in establishing the onset of possible inhibition among small trees. RECOMMENDATIONS (1) Use at least two measurements to locate each plant. In this study the second measurements on each tree made at the WRCCRF minimized the fieldwork required to correct data management errors once after they were identified. With two or more measurements to locate each plant it is simply necessary to determine which measurement is incorrect, and then map the plant from the remaining measurements. If plants are surveyed only from one measurement, and it is flawed, the false location may not be discovered; and if it is discovered, then it is necessary to resurvey the plant in the field. (2) Define absolute measurement error for mapped points under field conditions. It is important not to assume there is no measurement error. The type and size of errors depend upon the survey procedure and instruments used. Two techniques can be valuable; (a) a more comprehensive survey of a subplot, and/or one using more accurate equipment, to asses error when reduced measurements are used, and (b) repeated survey of a subplot. However these techniques may not enable estimation of cumulative errors over a large survey area. (3) Define relative error for the scale of pattern of interest. Necessary map accuracy is a function of the question being asked. A map of individual plant locations cannot be used to investigate processes occurring at scales that approach the accuracy of the measurements. If a map is only going to be used to investigate large-scale processes, such as clustering, then it does not need to be as accurate as a map that will be used to investigate small-scale processes such as inhibition. However use of Kˆ (d ) in ecology has frequently been concerned with detecting inhibition and defining its scale. If an individual plant location map is used for a general investigation of all spatial processes oc-

12 46 ELIZABETH A. FREEMAN AND E. DAVID FORD Ecology, Vol. 83, No. 1 curring in the community, it should be kept in mind that any spatial processes approaching the scale of the mapping accuracy might not be revealed by Kˆ (d ). When errors are considered, Kˆ (d ) is not equally powerful for all values of d. Relative error is greater for small values of d, so there is a greater chance of accepting the null hypothesis that Kˆ (d ) does not differ from complete spatial randomness, when in fact it does (Type II error). ACKNOWLEDGMENTS We are grateful to Dr. David Shaw and his colleagues at the WRCCRF for having facilitated the mapping of trees, and to Dr. Peter Guttorp for helpful suggestions during analysis. Research was performed in whole or in part at the Wind River Canopy Crane Research Facility located in the Thornton T. Munger Research Natural Area in Washington State, USA. The facility is a cooperative scientific venture among the University of Washington, the U.S. Forest Service Pacific Northwest Research Station, and the Gifford Pinchot National Forest. Although the research described in this article has been funded in part by the U.S. Environmental Protection Agency through agreement CR to the University of Washington, it has not been subjected to the Agency s required peer and policy review, and therefore does not necessarily reflect the views of the Agency and no official endorsement should be inferred. LITERATURE CITED Batista, J. L. F., and D. A. Maguire Modeling the spatial structure of tropical forests. Forest Ecology and Management 110: Besag, J Contribution to the discussion of Dr. Ripley s paper. Journal of the Royal Statistical Society B 39: Boose, E. R., E. F. Boose, and A. L. Lezberg A practical method for mapping trees using distance measurements. Ecology 79: Clark, P. J., and F. C. Evans Distance to nearest neighbor as a measure of spatial relationships in a population. Ecology 35: Connell, J. H On the role of natural enemies in preventing competitive exclusion in some marine animals and in rain forest tree diversity. Pages in P. J. den Boer and G. R. Gradwell, editors. Dynamics of numbers in populations. Proceedings of the Advanced Study Institute, Centre for Agricultural Publication and Documentation, Wageningen, The Netherlands. DeBell, D. S., and J. F. Franklin Old-growth Douglasfir and Western hemlock: a 36 year record of growth and mortality. Western Journal of Applied Forestry 2(4): Diggle, P. J Statistical analysis of spatial point patterns. Academic Press, London, UK. Donnelly, K Simulations to determine the variance and edge-effect of total nearest neighbor distance. Pages in I. Hodder, editor. Simulation methods in archeology. Cambridge University Press, London, UK. Ford, E. D Competition and stand structure in some even-aged plant monocultures. Journal of Ecology 63: Franklin, J. F., and D. S. DeBell Thirty-six years of tree population change in an old-growth Pseudotsuga Tsuga forest. Canadian Journal of Forest Research 18: Franklin, J. F., and C. T. Dyrness Natural vegetation of Oregon and Washington. Oregon State University Press, Corvallis, Oregon, USA. Freeman, E. A The effects of data quality on spatial statistics. Thesis. University of Washington, Seattle, Washington, USA. Haase, P Spatial pattern analysis in ecology based on Ripley s K-function: introduction and methods of edge correction. Journal of Vegetation Science 6: Hall, R. B. W A re-examination of the use of interpoint distances and least squares in mapping forest trees. Ecology 72: Hopkins, B A new method for determining the type of distribution of plant individuals. Annals of Botany 18: Janzen, D. H Herbivores and the number of tree species in tropical forests. American Naturalist 104: Kenkel, N. C Pattern of self-thinning in jack pine: testing the random mortality hypothesis. Ecology 69: Kenkel, N. C Modeling Markovian dependence in populations of Aralia nudicaulis. Ecology 74: Kenkel, N. C., M. L. Hendrie, and I. F. Bella A longterm study of Pinus banksiana population dynamics. Journal of Vegetation Science 8: Laesele, A. M Spacing and competition in natural stands of sand pine. Ecology 46: Legendre, P., and L. Legendre Numerical ecology. Second English edition. Elsevier Science, Amsterdam, The Netherlands. MathSoft S SPATIALSTATS User s manual, Version 1.0. MathSoft, Seattle, Washington, USA. Moeur, M Characterizing spatial patterns of trees using stem-mapped data. Forest Science 39: Moeur, M Spatial models of competition and gap dynamics in old-growth Tsuga heterophylla Thuja plicata. Forest Ecology and Management 94: Phillips, D. L., and J. A. MacMahon Competition and spacing patterns in desert shrubs. Journal of Ecology 69: Pielou, E. C The use of point plant distances in the study of the pattern of plant populations. Journal of Ecology 47: Pielou, E. C The use of plant-to-neighbour distances for the detection of competition. Journal of Ecology 50: Ripley, B. D Modeling spatial patterns (with discussion). Journal of the Royal Statistical Society. Series B 39: Rohlf, F. J., and J. W. Archie Least-squares mapping using interpoint distances. Ecology 59: Sterner, R. W., C. A. Ribic, and G. E. Schatz Testing for life history changes in spatial patterns of four tropical tree species. Journal of Ecology 74:

An Introduction to Point Pattern Analysis using CrimeStat

An Introduction to Point Pattern Analysis using CrimeStat Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

More information

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR

VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR VOLATILITY AND DEVIATION OF DISTRIBUTED SOLAR Andrew Goldstein Yale University 68 High Street New Haven, CT 06511 andrew.goldstein@yale.edu Alexander Thornton Shawn Kerrigan Locus Energy 657 Mission St.

More information

Fire, Forest History, and Ecological Restoration of Ponderosa Pine Forests at Mount Rushmore, South Dakota

Fire, Forest History, and Ecological Restoration of Ponderosa Pine Forests at Mount Rushmore, South Dakota Fire, Forest History, and Ecological Restoration of Ponderosa Pine Forests at Mount Rushmore, South Dakota Restoration uses the past not as a goal but as a reference point for the future...it is not to

More information

Lidar Remote Sensing for Forestry Applications

Lidar Remote Sensing for Forestry Applications Lidar Remote Sensing for Forestry Applications Ralph O. Dubayah* and Jason B. Drake** Department of Geography, University of Maryland, College Park, MD 0 *rdubayah@geog.umd.edu **jasdrak@geog.umd.edu 1

More information

Monte Carlo tests for spatial patterns and their change a

Monte Carlo tests for spatial patterns and their change a 1 Monte Carlo tests for spatial patterns and their change a Finnish Forest Research Institute Unioninkatu 40 A, 00170 Helsinki juha.heikkinen@metla.fi Workshop on Spatial Statistics and Ecology Perämeri

More information

Common Core Unit Summary Grades 6 to 8

Common Core Unit Summary Grades 6 to 8 Common Core Unit Summary Grades 6 to 8 Grade 8: Unit 1: Congruence and Similarity- 8G1-8G5 rotations reflections and translations,( RRT=congruence) understand congruence of 2 d figures after RRT Dilations

More information

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions

http://www.jstor.org This content downloaded on Tue, 19 Feb 2013 17:28:43 PM All use subject to JSTOR Terms and Conditions A Significance Test for Time Series Analysis Author(s): W. Allen Wallis and Geoffrey H. Moore Reviewed work(s): Source: Journal of the American Statistical Association, Vol. 36, No. 215 (Sep., 1941), pp.

More information

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard

Algebra 1 2008. Academic Content Standards Grade Eight and Grade Nine Ohio. Grade Eight. Number, Number Sense and Operations Standard Academic Content Standards Grade Eight and Grade Nine Ohio Algebra 1 2008 Grade Eight STANDARDS Number, Number Sense and Operations Standard Number and Number Systems 1. Use scientific notation to express

More information

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009

Prentice Hall Algebra 2 2011 Correlated to: Colorado P-12 Academic Standards for High School Mathematics, Adopted 12/2009 Content Area: Mathematics Grade Level Expectations: High School Standard: Number Sense, Properties, and Operations Understand the structure and properties of our number system. At their most basic level

More information

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12

SENSITIVITY ANALYSIS AND INFERENCE. Lecture 12 This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Ripley s K function. Philip M. Dixon Volume 3, pp 1796 1803. Encyclopedia of Environmetrics (ISBN 0471 899976) Edited by

Ripley s K function. Philip M. Dixon Volume 3, pp 1796 1803. Encyclopedia of Environmetrics (ISBN 0471 899976) Edited by Ripley s K function Philip M. Dixon Volume 3, pp 1796 183 in Encyclopedia of Environmetrics (ISBN 471 899976) Edited by Abdel H. El-Shaarawi and Walter W. Piegorsch John Wiley & Sons, Ltd, Chichester,

More information

AN INTRODUCTION TO HYPOTHESIS-TESTING AND EXPERIMENTAL DESIGN

AN INTRODUCTION TO HYPOTHESIS-TESTING AND EXPERIMENTAL DESIGN GENERL EOLOGY IO 340 N INTRODUTION TO HYPOTHESIS-TESTING ND EXPERIMENTL DESIGN INTRODUTION Ecologists gain understanding of the natural world through rigorous application of the scientific method. s you

More information

MATH2740: Environmental Statistics

MATH2740: Environmental Statistics MATH2740: Environmental Statistics Lecture 6: Distance Methods I February 10, 2016 Table of contents 1 Introduction Problem with quadrat data Distance methods 2 Point-object distances Poisson process case

More information

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I

BNG 202 Biomechanics Lab. Descriptive statistics and probability distributions I BNG 202 Biomechanics Lab Descriptive statistics and probability distributions I Overview The overall goal of this short course in statistics is to provide an introduction to descriptive and inferential

More information

A new & widely applicable bedform tracking tool

A new & widely applicable bedform tracking tool A new & widely applicable bedform tracking tool C.F. van der Mark & A. Blom University of Twente Faculty of Engineering Technology Department of Water Engineering and Management P.O. Box 217, 75 AE Enschede,

More information

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere

GEOENGINE MSc in Geomatics Engineering (Master Thesis) Anamelechi, Falasy Ebere Master s Thesis: ANAMELECHI, FALASY EBERE Analysis of a Raster DEM Creation for a Farm Management Information System based on GNSS and Total Station Coordinates Duration of the Thesis: 6 Months Completion

More information

Intro to GIS Winter 2011. Data Visualization Part I

Intro to GIS Winter 2011. Data Visualization Part I Intro to GIS Winter 2011 Data Visualization Part I Cartographer Code of Ethics Always have a straightforward agenda and have a defining purpose or goal for each map Always strive to know your audience

More information

NEW MEXICO Grade 6 MATHEMATICS STANDARDS

NEW MEXICO Grade 6 MATHEMATICS STANDARDS PROCESS STANDARDS To help New Mexico students achieve the Content Standards enumerated below, teachers are encouraged to base instruction on the following Process Standards: Problem Solving Build new mathematical

More information

How to Win the Stock Market Game

How to Win the Stock Market Game How to Win the Stock Market Game 1 Developing Short-Term Stock Trading Strategies by Vladimir Daragan PART 1 Table of Contents 1. Introduction 2. Comparison of trading strategies 3. Return per trade 4.

More information

WILDFLOWER RESTORATION PROJECT. Experimental Design and Data Collection Guide

WILDFLOWER RESTORATION PROJECT. Experimental Design and Data Collection Guide 1 Experimental Design and Data Collection Guide 2 INTRODUCTION This citizen science wildflower restoration project requires you to set up a study site, gather and plant seeds, and monitor changes in the

More information

How To Check For Differences In The One Way Anova

How To Check For Differences In The One Way Anova MINITAB ASSISTANT WHITE PAPER This paper explains the research conducted by Minitab statisticians to develop the methods and data checks used in the Assistant in Minitab 17 Statistical Software. One-Way

More information

The Research Proposal

The Research Proposal Describes the: The Research Proposal Researchable question itself Why it's important (i.e., the rationale and significance of your research) Propositions that are known or assumed to be true (i.e., axioms

More information

Inventory Enhancements

Inventory Enhancements Inventory Enhancements Chapter 6 - Vegetation Inventory Standards and Data Model Documents Resource Information Management Branch, Table of Contents 1. DETECTION OF CONIFEROUS UNDERSTOREY UNDER DECIDUOUS

More information

Fairfield Public Schools

Fairfield Public Schools Mathematics Fairfield Public Schools AP Statistics AP Statistics BOE Approved 04/08/2014 1 AP STATISTICS Critical Areas of Focus AP Statistics is a rigorous course that offers advanced students an opportunity

More information

NASA Explorer Schools Pre-Algebra Unit Lesson 2 Student Workbook. Solar System Math. Comparing Mass, Gravity, Composition, & Density

NASA Explorer Schools Pre-Algebra Unit Lesson 2 Student Workbook. Solar System Math. Comparing Mass, Gravity, Composition, & Density National Aeronautics and Space Administration NASA Explorer Schools Pre-Algebra Unit Lesson 2 Student Workbook Solar System Math Comparing Mass, Gravity, Composition, & Density What interval of values

More information

Hypothesis Testing for Beginners

Hypothesis Testing for Beginners Hypothesis Testing for Beginners Michele Piffer LSE August, 2011 Michele Piffer (LSE) Hypothesis Testing for Beginners August, 2011 1 / 53 One year ago a friend asked me to put down some easy-to-read notes

More information

There are three kinds of people in the world those who are good at math and those who are not. PSY 511: Advanced Statistics for Psychological and Behavioral Research 1 Positive Views The record of a month

More information

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing.

Introduction to. Hypothesis Testing CHAPTER LEARNING OBJECTIVES. 1 Identify the four steps of hypothesis testing. Introduction to Hypothesis Testing CHAPTER 8 LEARNING OBJECTIVES After reading this chapter, you should be able to: 1 Identify the four steps of hypothesis testing. 2 Define null hypothesis, alternative

More information

Settlement of Precast Culverts Under High Fills; The Influence of Construction Sequence and Structural Effects of Longitudinal Strains

Settlement of Precast Culverts Under High Fills; The Influence of Construction Sequence and Structural Effects of Longitudinal Strains Settlement of Precast Culverts Under High Fills; The Influence of Construction Sequence and Structural Effects of Longitudinal Strains Doug Jenkins 1, Chris Lawson 2 1 Interactive Design Services, 2 Reinforced

More information

Environmental Remote Sensing GEOG 2021

Environmental Remote Sensing GEOG 2021 Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class

More information

EXPERIMENTAL ERROR AND DATA ANALYSIS

EXPERIMENTAL ERROR AND DATA ANALYSIS EXPERIMENTAL ERROR AND DATA ANALYSIS 1. INTRODUCTION: Laboratory experiments involve taking measurements of physical quantities. No measurement of any physical quantity is ever perfectly accurate, except

More information

CHAPTER 7 TRAVERSE Section I. SELECTION OF TRAVERSE DEFINITION

CHAPTER 7 TRAVERSE Section I. SELECTION OF TRAVERSE DEFINITION CHAPTER 7 TRAVERSE Section I. SELECTION OF TRAVERSE DEFINITION A traverse is a series of straight lines called traverse legs. The surveyor uses them to connect a series of selected points called traverse

More information

Your Defensible Space Slideshow

Your Defensible Space Slideshow Your Defensible Space Slideshow Red = Trees to Remove Your Defensible Space Slideshow This slideshow was created to highlight actions you can take to dramatically improve the chances of your home surviving

More information

ECONOMIC INJURY LEVEL (EIL) AND ECONOMIC THRESHOLD (ET) CONCEPTS IN PEST MANAGEMENT. David G. Riley University of Georgia Tifton, Georgia, USA

ECONOMIC INJURY LEVEL (EIL) AND ECONOMIC THRESHOLD (ET) CONCEPTS IN PEST MANAGEMENT. David G. Riley University of Georgia Tifton, Georgia, USA ECONOMIC INJURY LEVEL (EIL) AND ECONOMIC THRESHOLD (ET) CONCEPTS IN PEST MANAGEMENT David G. Riley University of Georgia Tifton, Georgia, USA One of the fundamental concepts of integrated pest management

More information

2. Simple Linear Regression

2. Simple Linear Regression Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according

More information

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th

Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Vertical Alignment Colorado Academic Standards 6 th - 7 th - 8 th Standard 3: Data Analysis, Statistics, and Probability 6 th Prepared Graduates: 1. Solve problems and make decisions that depend on un

More information

Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith

Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith Policy Discussion Briefing January 27 Composite performance measures in the public sector Rowena Jacobs, Maria Goddard and Peter C. Smith Introduction It is rare to open a newspaper or read a government

More information

7 CONTINUOUS PROBABILITY DISTRIBUTIONS

7 CONTINUOUS PROBABILITY DISTRIBUTIONS 7 CONTINUOUS PROBABILITY DISTRIBUTIONS Chapter 7 Continuous Probability Distributions Objectives After studying this chapter you should understand the use of continuous probability distributions and the

More information

Diagrams and Graphs of Statistical Data

Diagrams and Graphs of Statistical Data Diagrams and Graphs of Statistical Data One of the most effective and interesting alternative way in which a statistical data may be presented is through diagrams and graphs. There are several ways in

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012 Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization GENOME 560, Spring 2012 Data are interesting because they help us understand the world Genomics: Massive Amounts

More information

Simple Random Sampling

Simple Random Sampling Source: Frerichs, R.R. Rapid Surveys (unpublished), 2008. NOT FOR COMMERCIAL DISTRIBUTION 3 Simple Random Sampling 3.1 INTRODUCTION Everyone mentions simple random sampling, but few use this method for

More information

Chapter 23. Inferences for Regression

Chapter 23. Inferences for Regression Chapter 23. Inferences for Regression Topics covered in this chapter: Simple Linear Regression Simple Linear Regression Example 23.1: Crying and IQ The Problem: Infants who cry easily may be more easily

More information

An Analysis of the Rossby Wave Theory

An Analysis of the Rossby Wave Theory An Analysis of the Rossby Wave Theory Morgan E. Brown, Elise V. Johnson, Stephen A. Kearney ABSTRACT Large-scale planetary waves are known as Rossby waves. The Rossby wave theory gives us an idealized

More information

Northumberland Knowledge

Northumberland Knowledge Northumberland Knowledge Know Guide How to Analyse Data - November 2012 - This page has been left blank 2 About this guide The Know Guides are a suite of documents that provide useful information about

More information

Measurement with Ratios

Measurement with Ratios Grade 6 Mathematics, Quarter 2, Unit 2.1 Measurement with Ratios Overview Number of instructional days: 15 (1 day = 45 minutes) Content to be learned Use ratio reasoning to solve real-world and mathematical

More information

Assume you have 40 acres of forestland that was

Assume you have 40 acres of forestland that was A l a b a m a A & M a n d A u b u r n U n i v e r s i t i e s ANR-1371 Basal Area: A Measure Made for Management Assume you have 40 acres of forestland that was recently assessed by a natural resource

More information

Agriculture, Fisheries and Conservation Department Conservation Branch. Nature Conservation Practice Note No. 02 Revised: June 2006

Agriculture, Fisheries and Conservation Department Conservation Branch. Nature Conservation Practice Note No. 02 Revised: June 2006 Agriculture, Fisheries and Conservation Department Conservation Branch Nature Conservation Practice Note No. 02 Revised: June 2006 Ref: AF GR CON 21/2 Measurement of Diameter at Breast Height (DBH) 1.

More information

Week 4: Standard Error and Confidence Intervals

Week 4: Standard Error and Confidence Intervals Health Sciences M.Sc. Programme Applied Biostatistics Week 4: Standard Error and Confidence Intervals Sampling Most research data come from subjects we think of as samples drawn from a larger population.

More information

Model Virginia Map Accuracy Standards Guideline

Model Virginia Map Accuracy Standards Guideline Commonwealth of Virginia Model Virginia Map Accuracy Standards Guideline Virginia Information Technologies Agency (VITA) Publication Version Control Publication Version Control: It is the user's responsibility

More information

John F. Cotton College of Architecture & Environmental Design California Polytechnic State University San Luis Obispo, California JOHN F.

John F. Cotton College of Architecture & Environmental Design California Polytechnic State University San Luis Obispo, California JOHN F. SO L I DMO D E L I N GAS A TO O LFO RCO N S T RU C T I N SO G LA REN V E LO PE S by John F. Cotton College of Architecture & Environmental Design California Polytechnic State University San Luis Obispo,

More information

LiDAR for vegetation applications

LiDAR for vegetation applications LiDAR for vegetation applications UoL MSc Remote Sensing Dr Lewis plewis@geog.ucl.ac.uk Introduction Introduction to LiDAR RS for vegetation Review instruments and observational concepts Discuss applications

More information

Regents Biology LAB. STUDY OF POPULATION DENSITY ON A SUBURBAN LAWN

Regents Biology LAB. STUDY OF POPULATION DENSITY ON A SUBURBAN LAWN Period Date LAB. STUDY OF POPULATION DENSITY ON A SUBURBAN LAWN Ecological communities are built on the interactions between the creatures (both plants and animals) that live there and the physical environment

More information

Global Seasonal Phase Lag between Solar Heating and Surface Temperature

Global Seasonal Phase Lag between Solar Heating and Surface Temperature Global Seasonal Phase Lag between Solar Heating and Surface Temperature Summer REU Program Professor Tom Witten By Abstract There is a seasonal phase lag between solar heating from the sun and the surface

More information

Imputing Values to Missing Data

Imputing Values to Missing Data Imputing Values to Missing Data In federated data, between 30%-70% of the data points will have at least one missing attribute - data wastage if we ignore all records with a missing value Remaining data

More information

How To Write A Data Analysis

How To Write A Data Analysis Mathematics Probability and Statistics Curriculum Guide Revised 2010 This page is intentionally left blank. Introduction The Mathematics Curriculum Guide serves as a guide for teachers when planning instruction

More information

United States Standards for Grades of Christmas Trees

United States Standards for Grades of Christmas Trees United States Department of Agriculture Agricultural Marketing Service Fruit and Vegetable Programs United States Standards for Grades of Christmas Trees Fresh Products Branch Effective October 30, 1989

More information

Defining the Beginning: The Importance of Research Design

Defining the Beginning: The Importance of Research Design Research and Management Techniques for the Conservation of Sea Turtles K. L. Eckert, K. A. Bjorndal, F. A. Abreu-Grobois, M. Donnelly (Editors) IUCN/SSC Marine Turtle Specialist Group Publication No. 4,

More information

FORESTED VEGETATION. forests by restoring forests at lower. Prevent invasive plants from establishing after disturbances

FORESTED VEGETATION. forests by restoring forests at lower. Prevent invasive plants from establishing after disturbances FORESTED VEGETATION Type of strategy Protect General cold adaptation upland and approach subalpine forests by restoring forests at lower Specific adaptation action Thin dry forests to densities low enough

More information

(1) 2 TEST SETUP. Table 1 Summary of models used for calculating roughness parameters Model Published z 0 / H d/h

(1) 2 TEST SETUP. Table 1 Summary of models used for calculating roughness parameters Model Published z 0 / H d/h Estimation of Surface Roughness using CFD Simulation Daniel Abdi a, Girma T. Bitsuamlak b a Research Assistant, Department of Civil and Environmental Engineering, FIU, Miami, FL, USA, dabdi001@fiu.edu

More information

MATHS LEVEL DESCRIPTORS

MATHS LEVEL DESCRIPTORS MATHS LEVEL DESCRIPTORS Number Level 3 Understand the place value of numbers up to thousands. Order numbers up to 9999. Round numbers to the nearest 10 or 100. Understand the number line below zero, and

More information

Removal fishing to estimate catch probability: preliminary data analysis

Removal fishing to estimate catch probability: preliminary data analysis Removal fishing to estimate catch probability: preliminary data analysis Raymond A. Webster Abstract This project examined whether or not removal sampling was a useful technique for estimating catch probability

More information

SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS. J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID

SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS. J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID SIMPLIFIED PERFORMANCE MODEL FOR HYBRID WIND DIESEL SYSTEMS J. F. MANWELL, J. G. McGOWAN and U. ABDULWAHID Renewable Energy Laboratory Department of Mechanical and Industrial Engineering University of

More information

Exercise 1.12 (Pg. 22-23)

Exercise 1.12 (Pg. 22-23) Individuals: The objects that are described by a set of data. They may be people, animals, things, etc. (Also referred to as Cases or Records) Variables: The characteristics recorded about each individual.

More information

Tutorial 5: Hypothesis Testing

Tutorial 5: Hypothesis Testing Tutorial 5: Hypothesis Testing Rob Nicholls nicholls@mrc-lmb.cam.ac.uk MRC LMB Statistics Course 2014 Contents 1 Introduction................................ 1 2 Testing distributional assumptions....................

More information

Maximum Range Explained range Figure 1 Figure 1: Trajectory Plot for Angled-Launched Projectiles Table 1

Maximum Range Explained range Figure 1 Figure 1: Trajectory Plot for Angled-Launched Projectiles Table 1 Maximum Range Explained A projectile is an airborne object that is under the sole influence of gravity. As it rises and falls, air resistance has a negligible effect. The distance traveled horizontally

More information

Tomlinson, R. F. 1962 Computer Mapping: An Introduction to the Use of Electronic Computers In the Storage, Compilation and Assessment of Natural and

Tomlinson, R. F. 1962 Computer Mapping: An Introduction to the Use of Electronic Computers In the Storage, Compilation and Assessment of Natural and Tomlinson, R. F. 1962 Computer Mapping: An Introduction to the Use of Electronic Computers In the Storage, Compilation and Assessment of Natural and Economic Data for the Evaluation of Marginal Lands Report

More information

On Correlating Performance Metrics

On Correlating Performance Metrics On Correlating Performance Metrics Yiping Ding and Chris Thornley BMC Software, Inc. Kenneth Newman BMC Software, Inc. University of Massachusetts, Boston Performance metrics and their measurements are

More information

Lesson 9: Introduction to the Landscape Management System (LMS)

Lesson 9: Introduction to the Landscape Management System (LMS) Lesson 9: Introduction to the Landscape Management System (LMS) Review and Introduction In earlier lessons, you learned how to establish and take measurements in sample inventory plots. In Lesson 8, you

More information

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS

BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 seema@iasri.res.in Genomics A genome is an organism s

More information

Stand Density Management Diagram

Stand Density Management Diagram H O W T O U S E A Stand Density Management Diagram Yield predictions for a spacing prescription Ministry of Forests Forest Practices Branch Canadian Cataloguing in Publication Data Main entry under title:

More information

Summarizing and Displaying Categorical Data

Summarizing and Displaying Categorical Data Summarizing and Displaying Categorical Data Categorical data can be summarized in a frequency distribution which counts the number of cases, or frequency, that fall into each category, or a relative frequency

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Primer Descriptive statistics Central tendency Variation Relative position Relationships Calculating descriptive statistics Descriptive Statistics Purpose to describe or summarize

More information

APPENDIX N. Data Validation Using Data Descriptors

APPENDIX N. Data Validation Using Data Descriptors APPENDIX N Data Validation Using Data Descriptors Data validation is often defined by six data descriptors: 1) reports to decision maker 2) documentation 3) data sources 4) analytical method and detection

More information

Name: Date: Use the following to answer questions 3-4:

Name: Date: Use the following to answer questions 3-4: Name: Date: 1. Determine whether each of the following statements is true or false. A) The margin of error for a 95% confidence interval for the mean increases as the sample size increases. B) The margin

More information

Coastwide Reference Monitoring System Wetlands (CRMS-Wetlands) Project Update

Coastwide Reference Monitoring System Wetlands (CRMS-Wetlands) Project Update Coastwide Reference Monitoring System Wetlands (CRMS-Wetlands) Project Update Louisiana Department of Natural Resources Coastal Restoration Division U.S. Geological Survey National Wetlands Research Center

More information

OUTLIER ANALYSIS. Data Mining 1

OUTLIER ANALYSIS. Data Mining 1 OUTLIER ANALYSIS Data Mining 1 What Are Outliers? Outlier: A data object that deviates significantly from the normal objects as if it were generated by a different mechanism Ex.: Unusual credit card purchase,

More information

Drawing a histogram using Excel

Drawing a histogram using Excel Drawing a histogram using Excel STEP 1: Examine the data to decide how many class intervals you need and what the class boundaries should be. (In an assignment you may be told what class boundaries to

More information

monitoring damage und perceiving risk under climate change impacts

monitoring damage und perceiving risk under climate change impacts Risk assessment and perception monitoring damage und perceiving risk under climate change impacts Marc Hanewinkel Swiss Federal Institute for Forest, Snow and Landscape Research WSL Research Unit: Forest

More information

Portland Trees www.portlandoregon.gov/trees - permits, tree removal, report a downed tree.

Portland Trees www.portlandoregon.gov/trees - permits, tree removal, report a downed tree. MEMORANDUM TO: Mike Abbaté, PP&R Director FROM: Jenn Cairo, City Forester DATE: 10/6/2015 RE: Mitigating the removal of large trees in development situations INTRODUCTION In the wake of recent public attention

More information

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures

Visualizing Data. Contents. 1 Visualizing Data. Anthony Tanbakuchi Department of Mathematics Pima Community College. Introductory Statistics Lectures Introductory Statistics Lectures Visualizing Data Descriptive Statistics I Department of Mathematics Pima Community College Redistribution of this material is prohibited without written permission of the

More information

GROWTH POTENTIAL OF LOBLOLLY PINE PLANTATIONS IN THE GEORGIA PIEDMONT: A SPACING STUDY EXAMPLE

GROWTH POTENTIAL OF LOBLOLLY PINE PLANTATIONS IN THE GEORGIA PIEDMONT: A SPACING STUDY EXAMPLE GROWTH POTENTIAL OF LOBLOLLY PINE PLANTATIONS IN THE GEORGIA PIEDMONT: A SPACING STUDY EXAMPLE Plantation Management Research Cooperative Daniel B. Warnell School of Forest Resources University of Georgia

More information

The LAndscape Management Policy Simulator (LAMPS) Pete Bettinger Department of Forest Resources Oregon State University

The LAndscape Management Policy Simulator (LAMPS) Pete Bettinger Department of Forest Resources Oregon State University The LAndscape Management Policy Simulator (LAMPS) Pete Bettinger Department of Forest Resources Oregon State University CLAMS Landscape Planning and Analysis Process Existing forest inventories Management

More information

Jiří Matas. Hough Transform

Jiří Matas. Hough Transform Hough Transform Jiří Matas Center for Machine Perception Department of Cybernetics, Faculty of Electrical Engineering Czech Technical University, Prague Many slides thanks to Kristen Grauman and Bastian

More information

Model-based Synthesis. Tony O Hagan

Model-based Synthesis. Tony O Hagan Model-based Synthesis Tony O Hagan Stochastic models Synthesising evidence through a statistical model 2 Evidence Synthesis (Session 3), Helsinki, 28/10/11 Graphical modelling The kinds of models that

More information

Robust procedures for Canadian Test Day Model final report for the Holstein breed

Robust procedures for Canadian Test Day Model final report for the Holstein breed Robust procedures for Canadian Test Day Model final report for the Holstein breed J. Jamrozik, J. Fatehi and L.R. Schaeffer Centre for Genetic Improvement of Livestock, University of Guelph Introduction

More information

Introduction to nonparametric regression: Least squares vs. Nearest neighbors

Introduction to nonparametric regression: Least squares vs. Nearest neighbors Introduction to nonparametric regression: Least squares vs. Nearest neighbors Patrick Breheny October 30 Patrick Breheny STA 621: Nonparametric Statistics 1/16 Introduction For the remainder of the course,

More information

Problem of the Month Through the Grapevine

Problem of the Month Through the Grapevine The Problems of the Month (POM) are used in a variety of ways to promote problem solving and to foster the first standard of mathematical practice from the Common Core State Standards: Make sense of problems

More information

Self-adjusting Importances for the Acceleration of MCBEND

Self-adjusting Importances for the Acceleration of MCBEND Self-adjusting Importances for the Acceleration of MCBEND Edmund Shuttleworth Abstract The principal method of variance reduction in MCBEND is the use of splitting and Russian roulette under the control

More information

Session 7 Bivariate Data and Analysis

Session 7 Bivariate Data and Analysis Session 7 Bivariate Data and Analysis Key Terms for This Session Previously Introduced mean standard deviation New in This Session association bivariate analysis contingency table co-variation least squares

More information

INTRODUCTION TO ERRORS AND ERROR ANALYSIS

INTRODUCTION TO ERRORS AND ERROR ANALYSIS INTRODUCTION TO ERRORS AND ERROR ANALYSIS To many students and to the public in general, an error is something they have done wrong. However, in science, the word error means the uncertainty which accompanies

More information

THE BASICS OF STATISTICAL PROCESS CONTROL & PROCESS BEHAVIOUR CHARTING

THE BASICS OF STATISTICAL PROCESS CONTROL & PROCESS BEHAVIOUR CHARTING THE BASICS OF STATISTICAL PROCESS CONTROL & PROCESS BEHAVIOUR CHARTING A User s Guide to SPC By David Howard Management-NewStyle "...Shewhart perceived that control limits must serve industry in action.

More information

Chapter 14 Managing Operational Risks with Bayesian Networks

Chapter 14 Managing Operational Risks with Bayesian Networks Chapter 14 Managing Operational Risks with Bayesian Networks Carol Alexander This chapter introduces Bayesian belief and decision networks as quantitative management tools for operational risks. Bayesian

More information

How To Run Statistical Tests in Excel

How To Run Statistical Tests in Excel How To Run Statistical Tests in Excel Microsoft Excel is your best tool for storing and manipulating data, calculating basic descriptive statistics such as means and standard deviations, and conducting

More information

Algebra 1 Course Information

Algebra 1 Course Information Course Information Course Description: Students will study patterns, relations, and functions, and focus on the use of mathematical models to understand and analyze quantitative relationships. Through

More information

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4)

Summary of Formulas and Concepts. Descriptive Statistics (Ch. 1-4) Summary of Formulas and Concepts Descriptive Statistics (Ch. 1-4) Definitions Population: The complete set of numerical information on a particular quantity in which an investigator is interested. We assume

More information

Quantitative Methods for Finance

Quantitative Methods for Finance Quantitative Methods for Finance Module 1: The Time Value of Money 1 Learning how to interpret interest rates as required rates of return, discount rates, or opportunity costs. 2 Learning how to explain

More information

Inv 1 5. Draw 2 different shapes, each with an area of 15 square units and perimeter of 16 units.

Inv 1 5. Draw 2 different shapes, each with an area of 15 square units and perimeter of 16 units. Covering and Surrounding: Homework Examples from ACE Investigation 1: Questions 5, 8, 21 Investigation 2: Questions 6, 7, 11, 27 Investigation 3: Questions 6, 8, 11 Investigation 5: Questions 15, 26 ACE

More information

Predicting Market Value of Soccer Players Using Linear Modeling Techniques

Predicting Market Value of Soccer Players Using Linear Modeling Techniques 1 Predicting Market Value of Soccer Players Using Linear Modeling Techniques by Yuan He Advisor: David Aldous Index Introduction ----------------------------------------------------------------------------

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information